On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?
Bookreader Item Preview
Share or Embed This Item
texts
On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?
On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?
ABSTRACT
The past 3 years of work in NLP have been characterized by the
development and deployment of ever larger language models, es-
pecially for English. BERT, its variants, GPT-2/3, and others, most
recently Switch-C, have pushed the boundaries of the possible both
through architectural innovations and through sheer size. Using
these pretrained models and the methodology of fine-tuning them
for specific tasks, researchers have extended the state of the art
on a wide array of tasks as measured by leaderboards on specific
benchmarks for English. In this paper, we take a step back and ask:
How big is too big? What are the possible risks associated with this
technology and what paths are available for mitigating those risks?
We provide recommendations including weighing the environmen-
tal and financial costs first, investing resources into curating and
carefully documenting datasets rather than ingesting everything on
the web, carrying out pre-development exercises evaluating how
the planned approach fits into research and development goals and
supports stakeholder values, and encouraging research directions
beyond ever larger language models.
The past 3 years of work in NLP have been characterized by the
development and deployment of ever larger language models, es-
pecially for English. BERT, its variants, GPT-2/3, and others, most
recently Switch-C, have pushed the boundaries of the possible both
through architectural innovations and through sheer size. Using
these pretrained models and the methodology of fine-tuning them
for specific tasks, researchers have extended the state of the art
on a wide array of tasks as measured by leaderboards on specific
benchmarks for English. In this paper, we take a step back and ask:
How big is too big? What are the possible risks associated with this
technology and what paths are available for mitigating those risks?
We provide recommendations including weighing the environmen-
tal and financial costs first, investing resources into curating and
carefully documenting datasets rather than ingesting everything on
the web, carrying out pre-development exercises evaluating how
the planned approach fits into research and development goals and
supports stakeholder values, and encouraging research directions
beyond ever larger language models.
AUTHORS
Emily M. Bender
University of Washington
Seattle, WA, USA
Timnit Gebru
Black in AI
Palo Alto, CA, USA
Angelina McMillan-Major
University of Washington
Seattle, WA, USA
Shmargaret Shmitchell
The Aether
University of Washington
Seattle, WA, USA
Timnit Gebru
Black in AI
Palo Alto, CA, USA
Angelina McMillan-Major
University of Washington
Seattle, WA, USA
Shmargaret Shmitchell
The Aether
ACM Reference Format:
Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, and Shmar-
garet Shmitchell. 2021. On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? . In Conference on Fairness, Accountability, and Transparency (FAccT ’21), March 3–10, 2021, Virtual Event, Canada. ACM, New York, NY, USA, 14 pages. https://doi.org/10.1145/3442188.3445922
Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, and Shmar-
garet Shmitchell. 2021. On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? . In Conference on Fairness, Accountability, and Transparency (FAccT ’21), March 3–10, 2021, Virtual Event, Canada. ACM, New York, NY, USA, 14 pages. https://doi.org/10.1145/3442188.3445922
- Addeddate
- 2022-06-13 15:10:11
- Doi
- 10.1145/3442188.3445922
- Identifier
- stochastic-parrots-3442188.3445922
- Identifier-ark
- ark:/13960/s23mvjnbvfs
- Ocr
- tesseract 5.1.0-1-ge935
- Ocr_detected_lang
- en
- Ocr_detected_lang_conf
- 1.0000
- Ocr_detected_script
- Latin
- Ocr_detected_script_conf
- 1.0000
- Ocr_module_version
- 0.0.16
- Ocr_parameters
- -l eng
- Page_number_confidence
- 100.00
- Ppi
- 300
- Scanner
- Internet Archive HTML5 Uploader 1.6.4
comment
Reviews
There are no reviews yet. Be the first one to
write a review.
1,099 Views
5 Favorites
DOWNLOAD OPTIONS
Temporarily Unavailable
For users with print-disabilities
IN COLLECTIONS
Community TextsUploaded by itshub on